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Abstract: Consider n unit intervals, say [1, 2], [3, 4], ... , [2n — 1, 2n\. Identify their end- 
points in pairs at random, with all (2n — 1)!! = (2n — l)(2n — 3) • • - 3 • 1 pairings being 
equally likely. The result is a collection of cycles of various lengths, and we investigate the 
distribution of these lengths. The distribution is similar to that of the distribution of the 
lengths of cycles in a random permutation, but it also exhibits some striking differences. 



1. Introduction 

Our concern in this paper is with a process for randomly joining intervals into cycles. 
Consider n unit intervals, say [1, 2], [3, 4], ... , [2n — 1, 2n]. Identify their endpoints in pairs 
at random, with all (2n — 1)!! = (2n — l)(2n — 3) ■ ■ - 3 • 1 pairings being equally likely. The 
result, which we call a random n-cyclation, is a collection of cycles of various lengths, and 
we investigate the distribution of these lengths. 

This process may be considered as a model for a chemical process of cyclation, whereby 
n molecules of a bilaterally symmetric monomer such as ethane, C2H6, are dehydogenated 
to form cyclic polymers such as cyclic polyethylene, (CH2)^"^. (The assumption that all 
pairings of endpoints are equally likely is not realistic, since the likelihood of endpoints to 
pair will be influenced by their physical proximity. We are, however, following a tradition in 
combinatorics going back to Cayley [C], in using an idealized model to give our calculations 
a chemical interpretation. The paper just cited, for example, overcounts the isomers of 
the paraffin scries, since the trees enumerated may have an exponentially growing number 
of vertices within a given distance of a particular vertex, whereas a molecule in three- 
dimensional space may have only a cubically growing number of atoms within a given 
distance of a particular atom.) 

If the intervals to be joined were oriented, with for example 2m — 1 being the tail 
and 2m being the head of the interval [2m — 1, 2m], and if they had to be joined head- 
to-tail consistently with this orientation, there would then be just nl = n ■ {n — 1) ■ ■ - 2 ■ 1 
consistent pairings, and the distribution we seek would be that of the lengths of cycles in 
a random n-permutation. (See Goncharov [Gl, G2] for pioneering work on this distribu- 
tion.) The distribution of cycle lengths for random cyclations is similar to that for random 
permutations, but it also exhibits some striking differences. 

In Section 2 wc shall study random cyclations primarily as regards their number of 
cycles, largely disregarding the lengths of these cycles. (In this respect, we are studying 
the topology of random cyclations, rather than their geometry.) 

The number of n-permutations with k cycles is a Stirling number of the first kind, 
which (following Knuth [K2]) we denote [^] . If 

*n(0= E 

l<fe<n 

denotes the generating function for n-permutations according to their number of cycles, 
then we have 

*n(o = e(e+i)---(e+n-i) (i.i) 
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(see Knuth [K2], 1.2.9, (27)). Setting ^ = 1 yields the obvious 



E 

l<k<n 



n 

k 



nl 



Let us denote by [ [^] ] the number of n-cyclations with k cycles, and by 



l<k<n 





n 
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the genrating function for n-cyclations according to their number of cycles. We shall show 
in Section 2 that 



*n(r7) = 2-$,(r7/2) 

= r/(r/ + 2)---(r/ + 2n-2). 

We shall first give a combinatorial proof of (1.2) by establishing the identity 



(1.2) 
(1.3) 
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(1.4) 



We shall also present an analytic argument (inspired by a result of Harer and Zagier [H]) 
that gives (1.3) directly, and also yields the following curious analogy: n-cyclations are to 
n-permutations as the (n — l)-sphere is to the (n — l)-simplex. In any case, setting r] = 1 
in (1.3) yields the obvious 

^ [M]=(2„-i)n. 

These results allow us to calculate the probabilities of the extreme cycle structures. 
For the largest possible number n of cycles, we have [^] = [[^]] = 1 from (1.1) and (1.3), 
so the probability that a random n-permutation or n-cyclation has n cycles is 1/n! or 
l/(2n — 1)!!, respectively. For the smallest possible number 1 of cycles, we encounter our 
first surprise: from (1.1) and (1.3) we have [l\ = (n- 1)! and = 2"-^ (n- 1)!. Thus 
the probability that a random n-permutation or n-cyclation has just a single cycle is 1/n 
or 2""-^ (n - l)!/(2n - 1)!!, respectively. We have 



2n-l _ 

(2n- 1)!! 



)2n-l 



1 /^N 

2 \n) 



{n ~ 1)! n! 
(2^^)! 

1/2 
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(where we have used the identity (2n— 1)!! = (2n)!/2"' n\ and Stirhng's asymptotic formula). 
Thus the probabihty of having just a single cycle is substantially larger for cyclations than 
for permutations. 

These results also allow us to calculate the expected number of cycles (as well as the 
higher moments, if desired) in random permutations and cyclations. From (1.1) and (1.3), 
we have 

n\ ^ m 

l<m<n 
= Hn 

= logn + 7 + 0(- 



where 7 = 0.577. . . is Euler's constant (see Knuth [K2], 1.2.7, (3)), and 

(2n-l)!! ^ 2m -1 

^ ^ l<m<n 

= H2n — \Hn 

1 7/1 

= -logn + log2 + ^ + - 

2 2 \n 

(All logarithms in this paper are to the natural base e = 2.718 ) 

In Section 3 wc shall study the lengths of the cycles in random cyclations. Our 
approach is modelled upon that used by Shepp and Lloyd [S] to study the cycle lengths in 
random permutations. We summarize their approach as follows. 

From an exact expression (formula (2.1) below) for the number of n-permutations 
with cycles structure (zi, 12-, • • • , in) (that is, with ii cycles of length 1, 12 cycles of length 
2, . . . , and cycles of length n), it is apparent that the cycle structure of a random 
n-permutation has the following interpretation: let ii, ^2, . . . , ^n be independent Poisson- 
distributed random variables with means 1, 1/2, . . . , 1/n, respectively, and condition on 
the event that cycles with these lengths contain altogether n elements (that is, that n = 
^KKn ^*')- Equivalently, we may let ii, i2, . . . . . ■ he independent Poisson-distributed 
random variables with means 1, 1/2, . . . , 1//, . . . , respectively, and condition on the event 
that n — X]/>i lii)- (This event now has probability 0, since it implies that in+i = in+2 = 
■ • ■ = 0.) Shepp and Lloyd hit upon the idea of letting ii,i2, . . . ,ii, . . . be independent 
Poisson-distributed random variables, without any conditioning. If this were done with the 
means indicated above, the sum u = ^;>i Hi, representing the total number of elements 
in all the cycles, would be infinite with probability 1. Thus they fix < ^ < 1 and use 
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the means z, z^/2,...,z^ /I,.... This results in v having a geometric distribution: the 
probabihty that v = nis {1 — z) z'^, and (1 — z)/{l — zQ is the generating function for u. 
Furthermore, when i/ = n, the conditional distribution of the cycle structure (^l, ^2, • • • , in) 
is that of a random n-permutation. (Since n = is possible in this model, we should 
agree that there is just one 0-permutation, which of course has no cycles.) This allows 
asymptotic information about the cycle structure as n — > oo to be obtained by applying 
Tauberian theorems to results concerning the limit 2; — > 1. Shepp and Lloyd show, among 
other things, that the expectation of the length L„ of the longest cycle in a random n- 
permutation has the asymptotic behavior 



POO 

Jo 



where 



00 



E{x)= I —dy, 



y 

and g-E(x)-x _ Q g243 . . . , while the expectation of the length Sn of the shortest 
cycle in a random n-permutation satisfies 

E:K[Sn] ~ e"^' logn, 

where e~'^ = 0.5614 (We note that the function E{x) can be expressed as the in- 
complete gamma-function r(0, x), or in terms of the exponential integral, as — Ei(— x); see 
Lebedev [L], (3.1.3).) 

For n-cyclations, we have a similar interpretation, with the means of the independent 
Poisson-distributed random variables being 1/2, 1/4, . . . , 1/2Z, . . . before conditioning. We 
then let the means be z/2, 2^/4, . . . , ;2^/2/, . . . without conditioning. The number v = 
"^lyi lii of intervals then has a negative-binomial distribution: the probability that ly = 

n is (1 — 2;)^/^ (^") (2/4)", and ((1 — z)/{l — zQ)^^'^ is the generating function for u. 
Furthermore, when u = n, the conditional distribution of the cycle structure (zi, i2, . . . , in) 
is that of a random n-cyclation. (We agree of course that there is just one 0-cyclation, 
and thus that (—1)!! — 1.) Again applying Tauberian theorems, we encounter our second 
surprise. We shall show in Section 3 that the expectation of the length M„ of the longest 
cycle in a random n-cyclation satisfies 

/•oo 

Ex[M„] - / e-^^^^/^-^dxn, (1.5) 
Jo 
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where 



/ 

^0 



oo 



while the expectation of the length of the shortest cycle in a random n-cyclation satisfies 



where tt = 3.1415... is the circular ratio, and (7r^/^/2) e^^^^/'^~^ dx — 1.4572.... 
Thus the expected length of the shortest cycle is substantially larger for cyclations than 
for permutations. 

2. The Number of Cycles 

In this section we shall give two proofs of (1.3). Each proof in fact establishes the 
connection (1.2) between cyclations and permutations. 

The first proof esablishes the identity (1.4). We begin by deriving a formula for the 
number of n-permutations with k cycles. Consider the n-permutations with cycle 
structure (ii, Z2, . . . , in), where n = Xli<z<n^^^ ^ ~ Si<z<n^^ Such a permutation 
can be specified as follows. Arrange the elements 1, 2, . . . , n in some order from left to right 
(which can be done in n\ ways). Partition these elements into blocks by forming ii blocks 
of size 1 from the leftmost ii elements, 12 blocks of size two from the next 2z2 elements, 
and so forth, with the last n elements forming blocks of size n. Form a cycle of the 
permutation from each block by letting the successor of each element of the block except 
the rightmost be the next element to the right, with the successor of the rightmost element 
of the block being the leftmost element of that block. It is clear that every n-permutation 
with cycle structure (zi, Z2, . . . , in) can be obtained in this way. Indeed, since the same 
n-permutation is obtained if the block of elements forming a cycle of length / are cyclically 
permuted among themselves (which can be done in / ways) , or if the blocks of elements 
forming the cycles of length / are permuted among themselves (which can be done in ii\ 
ways), there are exactly 



left-to-right orders that yield any given n-permutation with cycle structure {ii,i2, ■ ■ ■ ,in)- 
Thus if [zi, Z2, . . . , in] denotes the set of n-permutations with cycle structure {ii,i2, ■ ■ ■ , in) 
(it is in fact a conjugacy class in the symmetric group on n elements), then we have 




(1.6) 



n 



KKn 



#[ii,i2, ...,in] 



n 



(2.1) 



KKn 
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and thus 

n 
k 



l<Z<n 

where the sum is over aU cycle structures {ii,i2, • • • ,'in) such that n — X^i<i<„^^/ and 
^ ^ J2l<l<n ^^ 

Next we derive an analogous formula for the number [[^]] of n-cyclations with k 
cycles. Consider the n-cyclations with cycle structure (zi, ^2, ■ ■ ■ , in), where n — ^i<;<„ I H 
and k — J2i<i<n'''i- Such a cyclation can be specified as follows. Arrange the elements 
1, 2, . . . , n in some order from left to right (which can be done in n! ways), and add primes 
to some subset of these elements (which can be done in 2"^ ways). Replace each unprimed 
element m by the interval [2m — 1,2m] with its smaller endpoint 2m — 1 to the left of 
its larger endpoint 2m, and replace each primed element m' by the interval [2m — 1, 2m] 
with its larger endpoint to the left of its smaller endpoint. Partition these intervals into 
blocks by forming ii blocks of size 1 from the leftmost ii intervals, ^2 blocks of size two 
from the next 2z2 intervals, and so forth, with the last nin intervals forming z„ blocks 
of size n. Form a cycle of the cyclation from each block by letting the right endpoint of 
each interval of the block except the rightmost be paired with the left endpoint of the 
next interval to the right, with the right endpoint of the rightmost interval of the block 
being paired with the left endpoint of the leftmost interval of that block. It is clear that 
every n-cyclation with cycle structure (ii, ^2, ■ ■ ■ , in) can be obtained in this way. Indeed, 
since the same n-cyclation is obtained if the elements in a block giving rise to a cycle of 
length I are cyclically permuted among themselves (which can be done in / ways), or if 
they are cyclically permuted after reversing their left-to-right order and complementing 
their primed-or- unprimed status (which can be done in another I ways), or if the blocks of 
elements giving rise to the cycles of length I are permuted among themselves (which can 
be done in ii\ ways), there are exactly 

n (20^' 

l<Z<n 

left-to-right orders and choices of primed elements that yield any given n-cyclation with 
cycle structure (ii, ^2, • • • , in)- Thus if [[ii, 12, ... , in]] denotes the set of n-cyclations with 
cycle structure {ii,i2, • • • lin), then we have 

KKn 
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n 











and thus 

y- "'^^ (24) 

l<l<n 

where the sum is over aU cycle structures {ii,i2, ■ ■ ■ jin) such that n = X^i<i<„^^z and 

^ ^ Sl</<n^'- 

Comparing (2.2) with (2.4) yields (1.4). It is easy to see that this proof of (1.4) 
is "bijective", in the sense that the derivations of (2.2) and (2.4) can be used to give an 
explicit combinatorial correspondence between appropriately defined sets of size 2" and 
2^^ [[fc]]- Multiplying by r}'^ and summing over 1 < k <n then yields (1.2). Finally, from 
(1.2) and the definition of $n(0 we obtain (1.3). 

We shall now give a second, analytic rather than bijective, proof of (1.3). The starting 
point for this proof is a rather circuitous proof of (1.1). We begin by observing that since 
both sides of (1.1) are polynomials in ^, it will suffice to establish it for ^ a positive integer. 
To this end, we shall consider the number of n-permutations in which each cycle is colored 
with one of ^ colors. Since a permutation with k cycles can be colored in ways, the 
number of such colored n-permutations is $n(C)- We can count the number of colored 
permutations in another way as follows. We begin by assigning a color to each of the 
elements 1, 2, . . . , n. We then arrange the elements into cycles in such a way that all the 
elements in a given cycle have the same color. We then sum the number of permutations 
obtained over all ways of assigning colors to elements. There are (^.^ ^.^) ways of 
assigning the first color to ji elements, the second color to j2 elements, and so forth, with 
elements receiving the ^-th color. (We must of course have n = X]i<q,<^ ja-) Given such 
a distribution of colors, there are Jq! ways of arranging the ja elements receiving color a 
into cycles (that is, permuting them among themselves). Thus 

^n(o= E (. . .) n (2-5) 



where the sum is over all compositions {ji,j2, ■ ■ ■ jj^) of n into ^ non-negative parts. From 
the formula 



we obtain 



n! 



Jo Jo 

"OO POO 



which is (1.1), since the number of compositions of n into ^ non-negative parts is ("^f 7^) = 
(n + e-l)!/(e-l)!n!. 

If we did not know the formula (2.6), we could still proceed as follows. Replace each 
factorial in (2.5) with its integral representation j\ = dx. This yields 

E (. " .) 

(31,32, ■■■,Ji) 

fOO /-OO foo 

/ • ■ • / xi'xi' • • • e-^^i+^2+-+^«) dxi da;2 • ■ • dx^ 
Jo Jo 

= / •■•/ {xi + X2 + --- + x^)''e-^'''+'''+-+''^Uxidx2---dx^, 
Jo Jo Jo 

by the multinomial theorem. Since the integrand is now a function of s = Xi+X2 + - ■ ■ + x^, 
we obtain 

pOO 

^n(e)= / s^e-' s^-'ds Vol^_i(A^_i) 
Jo 

= (n + e-l)!Vol^_i(A5_i) 
= e(e + l)---(e + n-l), 

where 

Vol^_i(A^_i) = l/(e-l)! (2.7) 

denotes the (^ — l)-dimensional volume of the (^ — 1) -dimensional simplex A^_i defined 
by the equation xi + X2 + ■ • ■ + = 1 and the inequalities a;i > 0, a;2 > 0, . . . , a;^ > in 
^-dimensional space. Thus we again obtain (1.1). (We observe that the identity (2.7) can 
be derived in the following way. Multiplying ^ copies of the identity 1 = dx we 

obtain 

poo poo poo 

1= / • • • / e-(^i+^^+-+^«) dxi dx2--- dx^. 

Jo Jo Jo 

This multiple integral can be evaluated like the preceding one, 

poo poo poo poo 

/ / •••/ e-(^i+^2+-+^«)dxia;2---dx^ = / e"" s^"^ Vole_i(A^_i) 
Jo Jo Jo Jo 

= (e-l)!Vole-i(A^_i). 



Combining these two evaluations of the multiple integral yields (2.7).) 

Next we give an analogous derivation of (1.3). It will suffice to establish it for r] an 
even positive integer. By considering the number of ways of assigning one of 77 colors to 
each cycle of each n-cyclation we obtain 

E i," ,) n 

I - ■ \ \Jli J2i ■ ■ ■ 1 Jr)/ 1^0^^ 
{31,32, ■■■,]r,) 1<P<'? 

We now use the integral representation (2j — 1)!! = (l/27r)^/^ y^^ e"^^ dy to obtain 

/OO pOO rOO 
/ • ■ • / vV'vl'' ■ ■ -y?" e--^^y"+y-+-+y"r,) dy, dy2 ■■■dyr, 
-00 J —00 J —00 

= T^y^2 / / ••• / (2/i+yi + --- + y?)"e-^(^?+^^'+-+^^)d|/id|/2---rf2/,. 

Since the integrand is now a function of r = {yf + 2/1 + ' " ' + y^V^'^i we obtain 

1 r°° 

n— 1 roo 

/ s"e-" s^/^-^ds Vol^_i(5^_i) 
Jo 

Vol^_i(5^_i) 



_ 2 

_ 2"-i (77/2 + 71-1)! 

- 

_ 2"(?7/2 + n-l)! 
- 1)! 

= r/(r7 + 2)---(?7 + 2n-2), 
where we have made the substitution r = (2s)^/^, and 

2^^ 

Vol,_,(5,_0 = ^-^^ (2.8) 

denotes the (ry — l)-dimensional volume of the (77 — 1) -dimensional sphere 5*^-1 defined by 
the equation + ?/| + • ■ • + = 1 in ?7-dimensional space. Thus we again obtain (1.3). 
(We observe that the identity (2.8) can be derived in the following way. Multiplying 77 



copies of the identity (277)^/2 = e'^y^ dy we obtai 



cam 



(2 



/OO poo poo 
/ •••/ e-^^y^^+y^^+-+yvUyidy2---dyrj. 
-00 J —00 J —00 



("OO 

/ e-' s'l/^-Us Vol^_i(5^_i) 
Jo 



This multiple integral can be evaluated like the preceding one, 

/oo POO PCX) 
/ . . . / dyi dy2 ■■■dyr, 

-oo J —oo J — oo 

POO 

Jo 

= 2''/2-i (r//2-l)!VoVi(Vi)- 
Combining these two evaluations of the multiple integral yields (2.8).) 

3. The Lengths of Cycles 

In this section we shall derive (1.5) and (1.6), establishing the asymptotic behaviors of 
the expectations Ex[M„] and Ex[T„] of the lengths M„ and of the longest and shortest 
cycles, in a random n-cyclation as n — > oo. This will be done in three steps. First, we 
shall introduce a new notion of random cyclation, we we call a random 2;-cyclation, where 
< 2; < 1. For a random 2;-cyclation, the total number of intervals is itself a random 
variable i/. If however we condition on the event v = n, the conditional distribution is that 
of a random n-cyclation. Second, we establish the asymptotic behaviors of the expectations 
ExfM^] and ExfT^] of the lengths Mg and Tg of the longest and shortest cycles, in a random 
2;-cyclation as 2; — > 1. Finally, we apply Tauberian theorems to deduce from these results 
the asymptotic behaviors of Ex[M„] and Ex[T„] as n — >■ 00. 

For the first step, let 2; be a real number in the range < 2; < 1. We shall define a ran- 
dom z-cyclation as follows. First, let ti, t2, • be independent Poisson-distributed 
random variables with means 2;/2, . . . , 2'/2Z, . . . , respectively. Next, define the ran- 
dom variable v by 

l>i 

Finally, choose a zv-cyclation with cycle structure {li, L2, ■ ■ ■ , ii^), with all 

n>^i,i2,...,i.]]= , (3.1) 

n (20^'^/! 

\<1<V 

cyclations having this cycle structure being equally likely. (Since n will always denote an 
integer, while z never will, no confusion between the notions of random n-cyclations and 
random 2;-cyclations should be possible.) 
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The number k = X];>i ii of cycles in a random 2;-cyclation is a sum of independent 
Poisson-distributed random variables, and thus is itself Poisson-distributed, with mean 



1>1 
log 



2 °l-z 

Of more importance for us will be the distribution of the random variable i/. For any 
cycle structure (h, Z2, . . . , ^^, • • •) (where we agree to identify cycle structures that differ 
from one another by appending or deleting any number, finite or infinite, of Os at the 
right-hand end), we have 

Pr[(ii, L2,...,Li,...) = {ii,i2, ...,ii,...)]= „ , (3.2) 



n 



ill 



l<l<n 

where 

n = (3.3) 

l>i 

Thus we have 

Prb = n] = (1-^)1/2^^ y — — i , (3.4) 

l<l<n 

where the sum is over all cycle structures (zi, Z2, . . . , i^, . . .) satisfying (3.1). To evaluate 
the sum in (3.4), we sum (2.3) over all cycle structures satisfying (3.1). The result is 

(2n-l)!! = n!2^ V — — ^ . 

l<l<n 

Combining this with (3.4) yields 

Pv[u = n] = {l-zy/'z^ (2n -l)!! 

Thus ly has a negative-binomial distribution: the generating function for i/ is 



MO 



l-z 
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Finally, we consider the conditional distribution of a random 2;-cyclation, given the 
event v = n. By comparing the probability (3.2) of a given cycle structure with the number 
(3.1) of equally likely cyclations with that cycle structure, we see that all (2n — 1)!! n- 
cyclations have equal probability (1 — z)^/'^ z'^/nl 2"'. Thus, the distribution of a random 
z-cyclation, conditioned on the event v = n, is that of a random n-cyclation. 

In particular, from (2.5) we obtain 



Ex[M^] = ^ Pr[zy = n] Ex[M„] 

n>0 

= (1--)"^E£(!")ex[M„]. (3.6) 

n>0 ^ ^ 



The next step is to determine the asymptotic behavior of Ex[Mz] as z ^ 1. We begin 
by determining the probability of the event Mg = I. For I >1, define 



^, 2^' 

l<m</-l 



with the understanding that ti = and 



2m 

l<m<oo 

= o log 



2 °l-z 

The event = I occurs if and only if (1) ii > 1, which occurs with probability 1 — 
Q-{ti+i-ti) ^ and (2) = i/+2 = ■ • ■ = 0, which occurs independently with probability 
g-(too-ti+i)_ Thus we have 



Pr[M^ = l] = e-(*--*'+i) - e-(*--*') 
fti+i 

e*^-^dt. (3.7) 

'ti 



f 

Jt, 



From (3.7) we have 

Ex[M,] = J]/Pr[M, = Z] 
l>i 

l>i -^^i 
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Making the substitution 

too-t = E{x)/2, (3.8) 

we obtain 

Ex[M,] = -5^/ / dx, (3.9) 

^ 1>1 -^^l ^ 

where xi is the unique solution of the equation 

too-ti = E{xi)/2. 

(The existence and uniqueness of x;, together with the inequahties < xi < a;2 < • ■ • < 
xi < xi^i < • • • < Xoo = oo, foUow from the fact that as I increases from 1 to oo, too ~ ti 
strictly decreases from too to 0, while as x increases from to oo, E{x)/2 strictly decreases 
from oo to 0.) Thus (3.9) may be rewritten 

Ex[M,] = 1 r e-^(-)/2-- M dx, (3.10) 

where l{x) denotes the unique integer I such that xi < x < xi^i. 

To determine the asymptotic behavior of the integral in (3.10), we shall need an 
estimate for l{x). For y in the range — mlog2; < y < — (m +1) log 2;, we have 

— (m + l)log2; ^ y ^ —mlogz 
Integrating over y in an integral of length — log z yields 



T</.: 



-(m+l)log2 g-y 



, 1 , dy < — , 

m+1 J-rnlogz y ™ 

and summing over m yields 

E{-nogz) < - ti < E{-{1 + l)logz)., 

which is equivalent to 

— {I — 1) log 2; < xi < —Hog 2;. 

Since — log 2; = (1 — z) + 0((1 — z)^) as 2; — > 1, we have 

l{x) _ 1 
X 1 — z 

in this limit. 



+ 0(1) (3.11) 



13 



We shall also need an estimate for xi. From the formula 



E{x) = — log X — ^ + 0{x) 

— (see Lebedev [L], (3.1.5)), and the definition 

E{xi)/2 = too-ti 

= o log 



2^1-^' 
we obtain 

x^^e-'^{l-z) + 0{{l-zf) (3.12) 

as 2; — > 1. 

Applying the estimates (3.11) and (3.12) to (3.10), we have 

Ex[MJ = - £ e-W/-« — + O (£ d.) 



Since 



/■oo pOO pXl 

/ e-^(-)/2--da;= / e-^^^^^'^-^ dx - e'^^^^l^-^ dx 

Jxi Jo Jo 

nOO 

= / e-^(^)/2-^da; + 0(a;i) 
io 

/■oo 

= / e-'^^'''^/^-''dx + 0{l-z) 
Jo 

(the integrand being bounded by 1), and 



/ 

J Xl 



dx = O [ log — 



X \ X\ J 

= o(log^ 



(the integrand being bounded by e ^ for x > 1 and by 1/x for x < 1). Formula (3.13) 
gives the desired estimate for Ex[M2]. 

The final step is to deduce the asymptotic behavior of Ex[M^] as n — >^ 00 from that 
of Ex[M;2] as 2; ^ 1. This will be done through the use of Tauberian theorems, which 
require "side conditions" for their application. In the case at hand, these side conditions 
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require that the coefficents ^ (^^) Ex[M^] in (3.6) be non-negative, which is trivial, and 
non-decreasing in n, which is equivalent to 

(2n + 1) Ex[M„+i] > (2n + 2) Ex[M^]. (3.14) 

To verify this condition, we start with a random n-cyclation C and randomly insert an 
(n + l)-st interval /, obtaining a random (?i + l)-cyclation. This cannot decrease the length 
of the longest cycle, and if the insertion puts / into a longest cycle of C (which happens 
with probability at least 2Ex[M„]/(2n -|- 1)), the length of the longest cycle increases by 
1. Thus we have 

Ex[M„+i] > Ex[M^] + Ex[M^], 

or equivalently 

(2n + 1) Ex[M„+i] > (2n + 3) Ex[M„], 

which is even stronger that (3.14). 
From (3.6) and (3.13) we have 

MM,] 



n>0 ^ ^ 



Ex[M„ 



(1 - ^)V2 

2 70 



(1 - ^)3/2 



as ^ 1. Since the coefficients ^(^^)Ex[M„] are non-negative, the final Tauberian 
theorem in Section 7.5 of de Bruijn [B] can be applied with (7 = 3/2) to give 

0<n<m ^ ^ ^ JU 

as m — > 00. Since the coefficients ^ (^^) Ex[M„] are non-decreasing in n, the summatory 
analog of the Tauberian theorem in Section 7.3 of de Bruijn [B] can be applied (with 
a = 1 /2) to give 

l(^;)Ex[M„l^-l,/~e-(«)/-d.„'/^ 

as n — > 00. (This summatory analog is in fact a corollary of the theorem in Section 7.3 of 
de Bruijn [B]: given a non- negative and non-decreasing sequence /„, define f{t) for t > 
by /(^) = /n for n < t < n + 1; then f{t) is also non-negative and non-decreasing; if 
Fm = Eo<n<m ~ (« + 1)~^ rn''+\ then we also have F{t) = f{x) dx^{a + 1)-^ r+i; 



15 



the theorem cited then imphes that f{t) ~ t", and thus we also have /„ ~ n'*.) Finally, 
since 

1 /2n\ 1 



we obtain (1.5) as desired. 

We now turn to the expectation Ex[T„] of the length of the shortest cycle in a 
random n-cyclation. Using the notion of a random z-cyclation as before, we have the 
analog of (3.6): 

Ex[T,] = ^Pr[i. = n]Ex[T„] 



n>0 



= (l-)"'E£(r)Exlr„l. (3.15) 

n>0 ^ ^ 

Our next step is to determine the asymptotic behavior of Ex[T2] as 2; ^ 1. We begin 
by determining the probability of the event — I- The event = / occurs if and only 
if (1) i^i > 1, which occurs with probability 1 — e~*^*'+i and (2) 61 = i2 = • • • = 0, 
which occurs independently with probability e~*' . and therefore 



= /I = - 



Pr[T^ = l\= e-*' - e 



This yields 

Ex[T,]=^/Pr[T, = Z] 

i>\ 



1-^ /"^i+i 



l>i 

Again making the substitution (3.8), and proceding as before, we obtain 

,1/2 rxi + i E{x)/2-x 



J X\ 



2 Jxi ^ 

Applying the estimates (3.11) and (3.12), we have 

1 r'^ 1 / r^o pE{x)/2-x \ 

ExK] = i e-<')/-' + O ((1 - .)'/^ I ^ ..) 

= i f e-(^)/-^ ^j-L^ + O ((1 - .)'/^ log ^) . (3.16) 
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Formula (3.16) gives the desired estimate for ExfT^]. 

Our final step is to deduce the asymptotic behavior of Ex[T„] as n — >^ oo from that 
of Ex[T2] as 2; — > 1. This will again be done through the use of Tauberian theorems. 
In this case, the side conditions require that the coefficents ^(^^)Ex[T^] in (3.15) be 
non-negative, which is trivial, and non-decreasing in n, which is equivalent to 

(2n + 2) Ex[T„] < (2n + 1) Ex[T„+i]. (3.17) 

To verify this condition, we start with a random (?2 + l)-cyclation C and delete the (n + 1)- 
st interval /, obtaining a random n-cyclation. If / is alone in a cycle of length 1, which 
happens with probability l/(2?i + 1), the shortest cycle is that of a random n-cyclation. 
With the remaining probability 2n/(2n + 1), the shortest cycle is that of a random (n + 
l)-cy elation, except that if / is taken from a shortest cycle in C, which happens with 
probability at least 2Ex[Tyj_|_i]/(2n -|- 1), the length of the shortest cycle is decreased by 1. 
Thus we have 

1 2n 2 

or equivalently 

nEx[T„] < (n-l)Ex[r„+i], 

which is even stronger than (3.17). 
From (3.15) and (3.16) we have 

z"" f2n\ ^ 1 _ Ex[T^ 

n>0 



(1-^)1/2 



1 ^..^/o „ , 1 



2 ./o 1-z 



as 2; — > 1. Since the coefficients Ex.[Tn] are non-negative, the final Tauberian theo- 

rem in Section 7.5 of de Bruijn [B] can be applied with (7 = 1) to give 



0<n<m 

as m ^ 00. Since the coefficients ^(^^) Ex[Tn] are non-decreasing in n, the summatory 
analog of the Tauberian theorem in Section 7.3 of de Bruijn [B] can be applied (with a — 0) 
to give 
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as n — > oo. Finally, since 




) 



7ri/2ni/2' 



1 



we obtain (1.6) as desired. 

4. Conclusion 

We have determined the distribution of the number of cycles, and the expectations of 
the lengths of the longest and shortest cycles, in a random n-cyclation. There are many 
directions in which further results might be found. As was done for random permutations 
by Shepp and Lloyd [S] , one might seek the expectations of the lengths of the r-th longest 
and r-shortest cycles (these are presumably asymptotic to appropriate constants times n 
and n^/^(logn)'^~^, respectively), or more generally the m-th moments of these quantities. 
One might also try to determine the actual distributions of the length of the shortest cycle 
(this presumably tends to the limiting distribution Pr[T„ = /] e~^'/^ — e~^'+i/^ as 
n ^ oo with / fixed, where e~^^l'^ —e~^^+^l'^ ~ e~'''/^/2/"^/^ as / oo), or of the normalized 
length Mrtjn of the longest cycle, which presumably tends to that of the largest component 
in a Poisson-Dirichlet distribution with parameter 1/2. (Poisson-Dirichlet distributions 
were introduced by Kingman [Kl]. The Poisson-Dirichlet distribution with parameter 1/2 
has been shown by Aldous [A] to govern the normalized size of the largest component in 
the graph of a random mapping from a set to itself.) 
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